Goto

Collaborating Authors

 pollution level


Veli: Unsupervised Method and Unified Benchmark for Low-Cost Air Quality Sensor Correction

Dalbah, Yahia, Worring, Marcel, Hsu, Yen-Chia

arXiv.org Artificial Intelligence

Urban air pollution is a major health crisis causing millions of premature deaths annually, underscoring the urgent need for accurate and scalable monitoring of air quality (AQ). While low-cost sensors (LCS) offer a scalable alternative to expensive reference-grade stations, their readings are affected by drift, calibration errors, and environmental interference. To address these challenges, we introduce Veli (Reference-free Variational Estimation via Latent Inference), an unsupervised Bayesian model that leverages variational inference to correct LCS readings without requiring co-location with reference stations, eliminating a major deployment barrier. Specifically, Veli constructs a disentangled representation of the LCS readings, effectively separating the true pollutant reading from the sensor noise. To build our model and address the lack of standardized benchmarks in AQ monitoring, we also introduce the Air Quality Sensor Data Repository (AQ-SDR). AQ-SDR is the largest AQ sensor benchmark to date, with readings from 23,737 LCS and reference stations across multiple regions. Veli demonstrates strong generalization across both in-distribution and out-of-distribution settings, effectively handling sensor drift and erratic sensor behavior. Code for model and dataset will be made public when this paper is published.


Forecasting and Visualizing Air Quality from Sky Images with Vision-Language Models

Vahdatpour, Mohammad Saleh, Eyvazi, Maryam, Zhang, Yanqing

arXiv.org Artificial Intelligence

Air pollution remains a critical threat to public health and environmental sustainability, yet conventional monitoring systems are often constrained by limited spatial coverage and accessibility. This paper proposes an AI-driven agent that predicts ambient air pollution levels from sky images and synthesizes realistic visualizations of pollution scenarios using generative modeling. Our approach combines statistical texture analysis with supervised learning for pollution classification, and leverages vision-language model (VLM)-guided image generation to produce interpretable representations of air quality conditions. The generated visuals simulate varying degrees of pollution, offering a foundation for user-facing interfaces that improve transparency and support informed environmental decision-making. These outputs can be seamlessly integrated into intelligent applications aimed at enhancing sit-uational awareness and encouraging behavioral responses based on real-time forecasts. W e validate our method using a dataset of urban sky images and demonstrate its effectiveness in both pollution level estimation and semantically consistent visual synthesis. The system design further incorporates human-centered user experience principles to ensure accessibility, clarity, and public engagement in air quality forecasting. T o support scalable and energy-efficient deployment, future iterations will incorporate a green CNN architecture enhanced with FPGA-based incremental learning, enabling real-time inference on edge platforms.


Assessing and Predicting Air Pollution in Asia: A Regional and Temporal Study (2018-2023)

Rahman, Anika, Khatun, Mst. Taskia

arXiv.org Artificial Intelligence

This study analyzes and predicts air pollution in Asia, focusing on PM 2.5 levels from 2018 to 2023 across five regions: Central, East, South, Southeast, and West Asia. South Asia emerged as the most polluted region, with Bangladesh, India, and Pakistan consistently having the highest PM 2.5 levels and death rates, especially in Nepal, Pakistan, and India. East Asia showed the lowest pollution levels. K-means clustering categorized countries into high, moderate, and low pollution groups. The ARIMA model effectively predicted 2023 PM 2.5 levels (MAE: 3.99, MSE: 33.80, RMSE: 5.81, R: 0.86). The findings emphasize the need for targeted interventions to address severe pollution and health risks in South Asia.


Applying IRT to Distinguish Between Human and Generative AI Responses to Multiple-Choice Assessments

Strugatski, Alona, Alexandron, Giora

arXiv.org Artificial Intelligence

Generative AI is transforming the educational landscape, raising significant concerns about cheating. Despite the widespread use of multiple-choice questions (MCQs) in assessments, the detection of AI cheating in MCQ-based tests has been almost unexplored, in contrast to the focus on detecting AI-cheating on text-rich student outputs. In this paper, we propose a method based on the application of Item Response Theory (IRT) to address this gap. Our approach operates on the assumption that artificial and human intelligence exhibit different response patterns, with AI cheating manifesting as deviations from the expected patterns of human responses. These deviations are modeled using Person-Fit Statistics (PFS). We demonstrate that this method effectively highlights the differences between human responses and those generated by premium versions of leading chatbots (ChatGPT, Claude, and Gemini), but that it is also sensitive to the amount of AI cheating in the data. Furthermore, we show that the chatbots differ in their reasoning profiles. Our work provides both a theoretical foundation and empirical evidence for the application of IRT to identify AI cheating in MCQ-based assessments.


Detecting Elevated Air Pollution Levels by Monitoring Web Search Queries: Deep Learning-Based Time Series Forecasting

Lin, Chen, Yousefi, Safoora, Kahoro, Elvis, Karisani, Payam, Liang, Donghai, Sarnat, Jeremy, Agichtein, Eugene

arXiv.org Artificial Intelligence

Real-time air pollution monitoring is a valuable tool for public health and environmental surveillance. In recent years, there has been a dramatic increase in air pollution forecasting and monitoring research using artificial neural networks (ANNs). Most of the prior work relied on modeling pollutant concentrations collected from ground-based monitors and meteorological data for long-term forecasting of outdoor ozone, oxides of nitrogen, and PM2.5. Given that traditional, highly sophisticated air quality monitors are expensive and are not universally available, these models cannot adequately serve those not living near pollutant monitoring sites. Furthermore, because prior models were built on physical measurement data collected from sensors, they may not be suitable for predicting public health effects experienced from pollution exposure. This study aims to develop and validate models to nowcast the observed pollution levels using Web search data, which is publicly available in near real-time from major search engines. We developed novel machine learning-based models using both traditional supervised classification methods and state-of-the-art deep learning methods to detect elevated air pollution levels at the US city level, by using generally available meteorological data and aggregate Web-based search volume data derived from Google Trends. We validated the performance of these methods by predicting three critical air pollutants (ozone (O3), nitrogen dioxide (NO2), and fine particulate matter (PM2.5)), across ten major U.S. metropolitan statistical areas (MSAs) in 2017 and 2018.


Deciphering Environmental Air Pollution with Large Scale City Data

Bhattacharyya, Mayukh, Nag, Sayan, Ghosh, Udita

arXiv.org Artificial Intelligence

Out of the numerous hazards posing a threat to sustainable environmental conditions in the 21st century, only a few have a graver impact than air pollution. Its importance in determining the health and living standards in urban settings is only expected to increase with time. Various factors ranging from emissions from traffic and power plants, household emissions, natural causes are known to be primary causal agents or influencers behind rising air pollution levels. However, the lack of large scale data involving the major factors has hindered the research on the causes and relations governing the variability of the different air pollutants. Through this work, we introduce a large scale city-wise dataset for exploring the relationships among these agents over a long period of time. We analyze and explore the dataset to bring out inferences which we can derive by modeling the data. Also, we provide a set of benchmarks for the problem of estimating or forecasting pollutant levels with a set of diverse models and methodologies. Through our paper, we seek to provide a ground base for further research into this domain that will demand critical attention of ours in the near future.


Scientists turn to deep learning to improve air quality forecasts

#artificialintelligence

Air pollution from the burning of fossil fuels impacts human health but predicting pollution levels at a given time and place remains challenging, according to a team of scientists who are turning to deep learning to improve air quality estimates. Results of the team's study could be helpful for modelers examining how economic factors like industrial productivity and health factors like hospitalizations change with pollution levels. "Air quality is one of the major issues within an urban area that affects people's lives," said Manzhu Yu, assistant professor of geography at Penn State. "Yet existing observations are not adequate to provide comprehensive information that may help vulnerable populations to plan ahead." Satellite and ground-based observations each measure air pollution, but they are limited, the scientists said.


UK to combat air pollution with AI-powered traffic lights

#artificialintelligence

The UK plans to tackle pollution with AI-powered traffic lights that delay the arrival of vehicles in toxic air hotspots. The system collects data on local pollution and traffic flows through roadside sensors, weather forecasts, and Bluetooth devices in cars. An algorithm then analyzes both live and historical data to predict where air pollution will spike within the next hour. When the system forecasts a sharp rise in toxic pollutants, the traffic light timings will automatically change. Drivers on their way to pollution hotspots will be held at red lights for up to 20 seconds longer than usual.


AI system can predict air pollution before it happens

#artificialintelligence

Air pollution kills an estimated seven million people every year and cities around the world are being forced to take action to do what they can to lower the risk to inhabitants. A team of Loughborough University computer scientists believe their AI system has the potential to provide new insight into the environmental factors that have significant impacts on air pollution levels. In particular it focuses on the amount of'PM2.5' In 2013, a study involving 312,944 people in nine European countries revealed that there was no safe level of particulates. PM2.5 particulates were found to be particularly deadly, blamed for a 36 per cent increase in lung cancer per 10 μg/m3 as they can penetrate deep into the lungs.


A novel artificial intelligence system that predicts air pollution levels

#artificialintelligence

Imagine being scared to breathe the air around you. An unusual concept for us here in the UK, but it is a genuine concern for communities all over the world with air pollution killing an estimated seven million people every year. A team of Loughborough University computer scientists are hoping to help eradicate this fear with a new artificial intelligence (AI) system they have developed that can predict air pollution levels hours in advance. The technology is novel for a number of reasons, one being that it has the potential to provide new insight into the environmental factors that have significant impacts on air pollution levels. Professor Qinggang Meng and Dr. Baihua Li are leading the project which is focused on using AI to predict PM2.5--particulate matter of less than 2.5 microns (10-6 m) in diameter--that is often characterized as reduced visibility in cities and hazy-looking air when levels are high.